Goto

Collaborating Authors

 pathological voice


Analysis and Detection of Pathological Voice using Glottal Source Features

Kadiri, Sudarsana Reddy, Alku, Paavo

arXiv.org Artificial Intelligence

Automatic detection of voice pathology enables objective assessment and earlier intervention for the diagnosis. This study provides a systematic analysis of glottal source features and investigates their effectiveness in voice pathology detection. Glottal source features are extracted using glottal flows estimated with the quasi-closed phase (QCP) glottal inverse filtering method, using approximate glottal source signals computed with the zero frequency filtering (ZFF) method, and using acoustic voice signals directly. In addition, we propose to derive mel-frequency cepstral coefficients (MFCCs) from the glottal source waveforms computed by QCP and ZFF to effectively capture the variations in glottal source spectra of pathological voice. Experiments were carried out using two databases, the Hospital Universitario Principe de Asturias (HUPA) database and the Saarbrucken Voice Disorders (SVD) database. Analysis of features revealed that the glottal source contains information that discriminates normal and pathological voice. Pathology detection experiments were carried out using support vector machine (SVM). From the detection experiments it was observed that the performance achieved with the studied glottal source features is comparable or better than that of conventional MFCCs and perceptual linear prediction (PLP) features. The best detection performance was achieved when the glottal source features were combined with the conventional MFCCs and PLP features, which indicates the complementary nature of the features.


Pathological Voice Classification Using Mel-Cepstrum Vectors and Support Vector Machine

Pishgar, Maryam, Karim, Fazle, Majumdar, Somshubra, Darabi, Houshang

arXiv.org Machine Learning

Abstract--Vocal disorders have affected several patients all over the world. Due to the inherent difficulty of diagnosing vocal disorders without sophisticated equipment and trained personnel, a number of patients remain undiagnosed. To alleviate the monetary cost of diagnosis, there has been a recent growth in the use of data analysis to accurately detect and diagnose individuals for a fraction of the cost. We propose a cheap, efficient and accurate model to diagnose whether a patient suffers from one of three vocal disorders on the FEMH 2018 challenge. I. INTRODUCTION The human standard of life can be severely affected by their individual pathological voice condition. This has also financially burdened several patients, organizations, and societies [1].Some of the common impairments to the voice are structural lesions, neoplasms, and neurogenic disorders [1].